ASSET Queries: A Set-Oriented and Column-Wise Approach to Modern OLAP
نویسندگان
چکیده
Modern data analysis has given birth to numerous grouping constructs and programming paradigms, way beyond the traditional group by. Applications such as data warehousing, web log analysis, streams monitoring and social networks understanding necessitated the use of data cubes, grouping variables, windows and MapReduce. In this paper we review the associated set (ASSET) concept and discuss its applicability in both continuous and traditional data settings. Given a set of values B, an associated set over B is just a collection of annotated data multisets, one for each b∈B. The goal is to efficiently compute aggregates over these data sets. An ASSET query consists of repeated definitions of associated sets and aggregates of these, possibly correlated, resembling a spreadsheet document. We review systems implementing ASSET queries both in continuous and persistent contexts and argue for associated sets’ analytical abilities and optimization opportunities.
منابع مشابه
A cubic-wise balance approach for privacy preservation in data cubes
A data warehouse stores current and historical records consolidated from multiple transactional systems. Securing data warehouses is of ever increasing interest, especially considering areas where data are sold in pieces to third parties for data mining practices. In this case, existing data warehouse security techniques, such as data access control, may not be easy to enforce and can be ineffe...
متن کاملSAP HANA - The Evolution of an In-Memory DBMS from Pure OLAP Processing Towards Mixed Workloads
The journey of SAP HANA started as an in-memory appliance for complex, analytical applications. The success of the system quickly motivated SAP to broaden the scope from the OLAP workloads the system was initially architected for to also handle transactional workloads, in particular to support its Business Suite flagship product. In this paper, we highlight some of the core design changes to ev...
متن کاملLate-Materialization using Sort-merge Join Algorithm
1. Daniel J. Abadi ,Daniel S. Myers, David J. DeWitt, Samuel R. Madden. Materialization Strategies in a Column-Oriented DBMS. Proceedings of ICDE 2007, Istanbul, Turkey. 2. Daniel J. Abadi, Samuel R. Madden, Nabil hachem. Column-Store vs Row-store How different are they really? SIGMOD’08, June 9–12, 2008, Vancouver, BC, Canada 3. Daniel Abadi,PeterBoncz,Stavros Harizopoulos, Stratos Idreos, Sam...
متن کاملUsing the column oriented NoSQL model for implementing big data warehouses
The column-oriented NoSQL (Not Only SQL) model provides for big data the most suitable model to the data warehouse and the structure of multidimensional data as the OLAP cube and allows it to be deployed in the cloud and a high scalability whilst delivering high performance. In the absence of a clear approach which allows the implementation of data warehouses using this model, we propose in thi...
متن کاملParallel Replication across Formats in SAP HANA for Scaling Out Mixed OLTP/OLAP Workloads
Modern in-memory database systems are facing the need of efficiently supporting mixed workloads of OLTP and OLAP. A conventional approach to this requirement is to rely on ETL-style, application-driven data replication between two very different OLTP and OLAP systems, sacrificing realtime reporting on operational data. An alternative approach is to run OLTP and OLAP workloads in a single machin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009